WORLD INTELLECTUAL PROPERTY ORGANIZATTON 
Intematioiul Bureau 




PCX 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATB^T COOPERATION TOEATY (PCT) 



(51) InternaUonal Patent Classification ^ : 

C07B 6iy00, COTK 1/04, C12Q 1/68, 
COIN 33/543, 33/551, 33/552, 33/544 



Al 



(11) International Publication Number: WO 00/09464 

(43) International Publication Date: 24 Febmaiy 2000 (24.02.00) 



(21) International AppUcation Number: PCr/US99/ 18600 

(22) International Filing Date: 16 August 1999 (16.08.99) 



(30) Priority Data: 
60/096.820 



17 August 1998 (17.08.98) 



US 



(71) AppUcant: PHYLOS. INC. [USAJS]; 128 Spring Street, Ux- 

ington. MA 02421 (US). 

(72) Inventor: LOHSE, Peter, 28 Skahan Road. Belmont, MA 

02178 (US). 

(74) Agent: ELBING, Karen; Clark & Elbing LLP, 176 Federal 
Street, Boston, MA 02U0-2214 (US). 



(81) Designated States: AE, AL, AM, AT, AU, AZ, BA, BB, BG. 
BR. BY, CA, CH, CN. CU, CZ, DE, DK, EE, ES, FI. GB, 
GD, GE, GH, GM, HR, HU. ID, IL, IN, IS, JP, KE, KG. 
KP. KR, KZ, LC, LK. LR, LS, LT, LU, LV, MD, MG. MK, 
MN, MW, MX. NO. NZ, PL. PT, RO, RU. SD, SE. SG, SI, 
SK, SU TJ. TM, TR. TT. UA, UG, UZ, VN, YU, ZA, ZW, 
ARIPO patent (GH. GM, KE. LS, MW, SD. SL. SZ, UG, 
ZW). Eurasian patent (AM. AZ, BY, KG, KZ. MD, RU. TJ, 
TM), European patent (AT, BE. CH. CY, DE, DK. ES, FI, 
FR, GB, GR, IE, IT, LU. MC. NL. PT, SE). OAPI patent 
(BF, BJ, CF, CG, CI, CM. GA, GN, GW, ML, MR. NE, 
SN, TD, TG). 



Published 

With international search report. 
Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: 



IDENTIFICATION OF COMPOUND-PROTEIN INTCRACTIONS USING LIBRARIES OF PROTEIN-NUCLEIC ACID 
FUSION MOLECULES 



(57) Abstract 



Disclosed herein is a method for detecting a compound-protein interaction, involving: (a) providing a compound library in which 
each member of the compound library is immobilized on a solid support; (b) contacting each member of the immobilized compound library 
in a single reaction chamber with each member of a protein-nucleic acid fusion library under conditions which allow the formation of 
compound-fusion complexes; (c) isolating the immobilized compound-fusion complexes; and (d) detecting a compound-fusion complex as 
an indication that the protein of the fusion interacts with the compound. In preferred embodiments, the protein is identified by reading the 
nucleic acid portion of the fusion, and the compound is identified by reading a detectable tag bound to either the compound or the solid 
support. 
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TDFNTTFTr ATTON OF COMPnT TND- 
PRQTRTN INTERACTTONS TISTNG T.TRR ARTF.S 
5 OF PROTEIN-NIJCLRTC ACTD FTISTON MOTPnT TT.R^ 

Background of the TTiventinn 
In general, the invention features screening methods involving 
nucleic acid-protein fusions. 

Screening is considered to be an efficient tool to identify binding 

10 interactions between proteins and small molecule compoimds derived from 
large pharmaceutically-based collections, new synthetic approaches such as 
combinatorial chemistry, or natural sources (TIBTECH, vol. 13, p. 1 15, 1995). 
However, the multidisciplinary nature of most screening techniques poses 
significant challenges. The most important challenge of such techniques is 

1 5 maintaining a ready supply of materials for the screen. Screening of small 
molecule compound libraries with different protein targets requires sufficient 
amounts of compoimd. Alternatively, screening of large compound libraries 
(for example, having 10^ members or greater) requires large amounts of 
recombinant protein. Another challenge is to operate the screen rapidly and 

20 cost effectively. Screening of compound libraries with different protein targets 
is generally time consuming if carried out in a sequential fashion. 

Lately, a method has been described for the isolation of proteins with 
desired properties out of a pool of proteins (Szostak et al., Selection of Proteins 
Using RNA-Protein Fusions, U.S.S.N. 09/007,005, January 14, 1998, and 

25 U,S.S.N. 09/247,190, February 9, 1999; and Roberts & Szostak, Proc. Natl. 
Acad. Sci. USA (1997) vol. 94, p. 12297-12302). This technique is 
accomplished by means of protein-RNA fusion molecules where each protein is 
covalently linked to its encoding RNA. The protein-RNA fusion technology 
may be used to screen cDNA libraries and to clone new genes on tHe basis of 

30 protehirprotein interactions (see, for example, Szostak et al., Selection of 
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Proteins Using RNA-Protein Fusions, U.S.S.N. 09/007,005, January 14, 1998, 
and U.S.S.N. 09/247,190, February 9, 1999). 



Summary of the Invention 
The purpose of the present invention is to efficiently identify protein- 
5 compound binding interactions (and, particularly, protein-small molecule 

interactions) by screening small molecule compounds with libraries of protein- 
nucleic acid fusions (for example, protein-RNA fusions) in a parallel fashion, 
thus providing a catalogue of small molecule-protein pairs. 

Accordingly, in a first aspect, the invention features a method for 

10 detecting a compound-protein interaction, the method involving: (a) providing 
a compoimd library in which each member of the compound library is 
immobilized on a solid support; (b) contacting each member of the 
immobilized compound library in a single reaction chamber with each member 
of a protein-nucleic acid fusion library under conditions which allow the 

1 5 formation of compound-fusion complexes; (c) isolating the immobilized 

compound-fusion complexes; and (d) detecting the compound-fusion complex 
as an indication that the protein of the fusion interacts with the compound. 

In preferred embodiments, the protein-nucleic acid fusion is either a 
protein-RNA fusion, a protein-DNA fiision, or a protein fused to a DNA-RNA 

20 hybrid; the solid support is a bead; each bead is coded with a imique detectable 
label; the compound of the complexed protein-nucleic acid fusion is identified 
by the unique detectable label associated with the bead; the detectable label is a 
peptide label, a nucleic acid label, a chemical label, a fluorescent label, or a 
radio fi-equency tag; the solid support is a chip and the compoxmd library is 

25 immobilized on the chip in an addressable array; each member of the protein- 
nucleic acid fusion library is detectably labeled; the compound-fusion complex, 
or the components thereof, are recovered by release firom the solid support; the 
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method further involves recovering the protein-nucleic acid fusion from the 
solid support and identifying the protein; the identity of the protein is 
determined from the sequence of the nucleic acid portion of the protein-nucleic 
acid fusion; and the compoimd is a small molecule. 
5 In a related aspect, the invention features a method for detecting a 

compound-protein interaction, the method involving: (a) providing a compound 
immobilized on a solid support; (b) contacting the immobilized compound with 
a protein-nucleic acid fusion library imder conditions which allow the fusion to 
bind to the compound; and (c) detecting a bound protein-nucleic acid fusion as 
10 an indication that the protein of the protein-nucleic acid fusion interacts with 
the compoxmd. 

In preferred embodiments, the protein-nucleic acid fusion is either a 
protein-RNA fixsion, a protein-DNA fusion, or a protein fused to a DNA-RNA 
hybrid; the protein-nucleic acid fusion is detectably labeled and the interaction 

15 is indicated by the association of the detectable label with the solid support; the 
bound protein-nucleic acid fusion is recovered by release from the solid 
support; the method further involves recovering the protein-nucleic acid fusion 
from the solid support and identifying the protein; the identity of the protein is 
determined from the sequence of the nucleic acid portion of the protein-nucleic 

20 acid fusion; the solid support is a colimm, glass slide, chip, or bead; and the 
compound is a small molecule. 

As used herein, by a "library" is meant a collection of at least two 
molecules (for example, molecules such as compounds or protein-nucleic acid 
fusions). A compound library preferably includes at least 10^ or 10^ members, 

25 and, more preferably, at least 10^, 10^, or 10^ members. A protein-nucleic acid 
library (for example, a protein-RNA library) preferably includes at least 10^ or 
10^ members, more preferably, at least 10"^, 10^ or 10^ members, and, most 
preferably, at least 10*° or 10'^ members. 
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By a *T)NA-RNA hybrid" is meant a DNA strand hybridized to a 
complementaiy RNA strand. Typically, the DNA strand is generated by 
reverse transcription of the RNA molecule. 

By "addressable array" is meant a fixed pattcm of immobilized 
5 objects on a solid surface in which the identity of the objects is known or can 
be readily determined. 

By a "small molecule" is meant a compound with a molecular weight 
of less than or equal to 10,000 Daltons, preferably, less than or equal to 1000 
Daltons, and, most preferably, less than or equal to 500 Daltons. 

10 The present invention provides a number of advantages. For 

example, the present methods reduce the amount of material required for a 
screen. In standard screens, considerable amounts of protein and small 
molecule compoimds are required because each compound is screened with a 
single protein in a spatially segregated chamber. A library of protein-nucleic 

1 5 acid fusion molecules, however, can be screened for binding interactions with 
small molecule compounds in the same reaction chamber in a parallel fashion. 
In addition, the protein target need not be cloned, overexpressed, or isolated^ 
but rather is screened as a protein-nucleic acid fusion molecule and identified 
by its coding nucleic acid. Moreover, material costs may be further reduced by 

20 miniaturization, which is facilitated by the present methods and is limited 
solely by the choice of detection method for the identification of small 
molecule-fusion complexes. 

In addition, the present invention provides advantages in terms of the 
time required to carry out a compoxmd screen. In particular, the methods 

25. described herein accelerate the identification of ligands (for example, small 
molecule ligands) by screening a library of protein targets with a library of 
potential ligands in a parallel fashion. In contrast to standard screens, where a 
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small compound library is screened for binding to different protems in a 
sequential fashion, small molecule compounds may be screened, in the present 
techniques, with a library of protein-nucleic acid fusions in a single assay. 
Consequently, the present invention facilitates the screening of members of a 
5 library of small molecule compounds for binding to the members of a library of 
proteins in a highly efficient manner. 

Other features and advantages of the invention will be apparent from 
the following detailed description, and from the claims. 

Rrtef Description of the Drawings 
10 FIGURE 1 is a schematic illustration of an exemplary approach to 

screening a compound immobilized on a solid support with a library of protein- 
nucleic acid fusions. 

FIGURE 2 is a schematic illustration of an exemplary approach to 
screening a library of compounds immobilized to beads with a library of 
1 5 protein-nucleic acid fusions. 

FIGURE 3 is a schematic illustration of an exemplary approach to 
screening an addressable array of compounds immobilized on a microchip with 
a library of protein-nucleic acid fusions. 

FIGURE 4 is a graph illustrating compound binding to an RNA- 
20 protein fusion on a bead solid support. 

Detailed Description 
The methods of the present invention facilitate the efficient 
identification of protein-compound (and, preferably, protein-small molecule) 
binding interactions by screening such compounds with libraries of protein- 
25 nucleic acid fiisions (for example, protein-RNA fusions), thus providing a 

catalog of compound-protein pairs. If desired, libraries of compoimds may be 



wo 00/09464 



PCTAJS99/186qO 



-6. 

screened against libraries of protein-nucleic acid fusions in a single screen. In 
preferred embodiments, either the compounds or the fusions are immobilized 
on a solid support (for example, a bead, chip, glass slide, or column) to simplify 
the screen and/or result readings. In addition, to facilitate the identification of 
5 compound-protein pairs, the compound (or the solid support to which it is 
immobilized) may be tagged with a detectable label characteristic of that 
particular compoxmd or compound fandly. 

Any compoimd may be screened by the methods of the invention, 
although small molecules represent preferred targets for screening, 
10 These and other aspects of the invention are now described in more 

detail below. These examples are provided for the purpose of illustrating the 
invention, and should not be construed as limiting. 

Screening Assays 

15 As discussed above, screening of compounds against protein-nucleic 

acid fusions (for example, protein-RNA fusions) may be carried out in a 
number of different formats. One particular format is illustrated in Figure 1 . 
By this approach, a single compound is immobilized on a column or any other 
solid surface using any one of a variety of standard methods. The solid phase- 

20 boxmd small molecule compound is then incubated with screening buffer 

containing BSA or another inert protein to reduce non-specific binding. Next, 
the buffer solution is removed, and the solid phase presenting the compound is 
incubated with a solution of a protein-nucleic acid fusion library, followed by 
washes with screening buffer to remove non-specifically bound fusion 

25 molecules. Specifically bound protein-nucleic acid fusions are then eluted (for 
example, by affinity elution using buffer containing free small molecule 
compound). "Reading" the nucleic acid (for example, RNA) portion of the 
eluted fusion molecules provides an identification of the protein that bound the 
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small molecule compound. Such a **reading" may be carried out as described 
below. 

Alternatively, mixltiple compounds may be screened simultaneously 
against multiple protein-nucleic acid fusions. Two exemplary formats for 
5 carrying out this type of screen are shown in Figures 2 and 3. In these formats, 
an encoded (addressable) library of small molecules is immobilized on beads or 
any other surface, such as a chip. The solid phase-boxmd library is then 
incubated with screening buffer containing BSA or another inert protein to 
reduce non-specific binding. Subsequently, the buffer solution is drained, and 

10 the small molecule compound library is incubated with a fusion library, 

followed by washes with screening buffer to remove non-specifically bound 
molecules. Protein-nucleic acid fusion molecules specifically bindmg to small 
molecules are then detected or, if a bead format is utilized, sorted and collected. 
A reading code (or tag or address) is used to identify the small molecule 

1 5 compoimd, and reading of the nucleic acid portion of boimd fusion molecules is 
used to identify the protein (as described below). 

Protein-nucleic acid fusion molecules of different genotypes and 
different phenotypes can sometimes bind to the same small molecule 
compound. If desired, therefore, the bound fraction of fusion molecules may 

20 be collected, amplified, and reincubated with an identified ligand under more 
stringent conditions (e.g., a lower concentration of protein-nucleic acid fusion). 
This process may be repeated any nxunber of times, allowing for the isolation of 
a receptor with any desired ligand affinity (for example, selection for a receptor 
having the highest affinity). 

25 In addition, once identified, a binding interaction between a solid 

phase-bound compound and a fusion molecule may be confirmed or analyzed 
by addition of firee ligand or fi'ee protein to a corapound-fiision complex in a 
standard binding assay. 
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The present screens may be used to identify unknoAvn compoxmd- 
protein interactions or may be exploited in circimistances where some general 
knowledge of an interaction (for example, between a ligand and a receptor) is 
available. In the latter case, biased libraries may be used for screening. Such 
5 libraries may contain particular classes of compoimds (or proteins) or 
modifications of a single compound (or protein). In general, the biasing 
element tends to increase the average affinity of a ligand for a target receptor 
and to orient the ligand in a uniform way (see, for example, Chen et al, JACS 
(1993) vol. 1 15, p. 12591-12592). This type of approach facilitates the 
1 0 identification, for example, of ligands which bind to a receptor at a targeted 
site. 

Preparation of Prn tein-Nucleic Acid Fusions 

As discussed above, the present techniques may be applied to any 
population of protein-nucleic acid fusions, including protein-RNA fusions, 
1 5 protein-DNA fusions, and fusions between proteins and hybrid DNA-RNA 
molecules. 

For use in the methods described herein, random libraries of protein- 
RNA fusion molecules may be prepared, for example, as described in Szostak 
et al., Selection of Proteins Using RNA-Protein Fusions, U.S.S.N. 09/007,005, 

20 January 14, 1998, and U.S.S.N. 09/247,190, Febmary 9, 1999; Roberts & 
Szostak, Proc. Natl. Acad. Sci. USA (1997) vol. 94, p. 12297-12302; or 
Kuimelis et al.. Addressable Protein Arrays, U.S.S.N. 60/080,686, April 3, 
1998, atfd U.S.S.N. 09/282,734, March 31, 1999). Altematively, libraries of 
cellular RNA-protein fusion molecules may be prepared from mRNAs or 

25 cDNAs that lack 3 -untranslated regions, for example, as described in Lipovsek 
et al. (Methods for Optimizing Cellular RNA-Protein Fusion Formation, 
U.S.S.N. 60/096,818, August 17, 1998) and Hammond et al. (Methods for 
Producing Nucleic Acids Lacking 3 -Untranslated Regions and Optimizing 
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Cellular RNA-Protein Fusion Formation, U.S.S.N. , August 1 7, 

1999). 

To label such protein-RNA fusions, any standard labeling method 
and any detectable label (including, for example, radioactive, fluorescent, and 
5 chemiluminescent labels) may be utilized. If desired, fusions may be 
radioactively labeled by generating the fusion or fusion components in the 
presence of radioactive amino acids (for example, ^^S- or '*C-labeled amino 
acids) or radioactive nucleotides (for example, ^^S- or "P-labeled nucleotides). 
Alternatively, fusion molecules may be fluorescentiy labeled. In one particular 

10 example, the DNA linker (for example, the dA27dCdCP linker described in 
Roberts & Szostak, Proc. Nati. Acad. Sci. USA (1997) vol. 94, p. 
12297-12302) may be modified with a fluorescein phosphoramidite marker 
(Glen Research, Sterling, VA), and this linker used for the synthesis of 
fluorescent protein-RNA fusions. In yet another alternative, protein-RNA 

1 5 fusions prepared according to the method of Roberts & Szostak (Proc. Nati. 
Acad. Sci, USA (1997) vol. 94, p. 12297-12302; and Selection of Proteins 
Using RNA-Protem Fusions, U.S.S.N. 09/007,005, January 14, 1 998, and 
U.S.S.N. 09/247,190, February 9, 1999) or cellular RNA-protein fusions 
prepared according to the method of Lipovsek et al. (Methods for Optimizing 

20 Cellular RNA-Protem Fusion Formation, U.S.S.N. 60/096,818, August 17, 
1998) or Hammond et al. (Methods for Producing Nucleic Acids Lacking 3 - 
Untranslated Regions and Optimizing Cellular RNA-Protein Fusion Formation, 

U.S.S.N. , August 17, 1999) may be labeled by base pairing the 

fusion to a fluorescentiy-labeled oligonucleotide (for example, base pairing a 

25 fluorescent poly-dT oligonucleotide to the dA27dCdCP linker). 

Alternatively, protein-DNA fusions may also be labeled using 
similar techniques. Such protein-DNA fusions may be generated as described, 
for example, in Lohse et al., DNA-Protein Fusions and Uses Thereof, U.S.S.N. 
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60/1 10,549, December 2, 1998. In yet another alternative, the above labeling 
techniques may be used for fusions of proteins to hybrid DNA-RNA portions 
(i.e., one strand of each). Such hybrid fusions are generated, for example, by 
subjecting a RNA-protein fusion to a step of reverse transcription using 
5 standard techniques. 
Preparation of Compounds 

For carrying out the screening methods of the invention, any 
compound library may be utilized. Such libraries may be derived from natural 
products, synthetic (or semi-synthetic) extracts, or chemical libraries according 

10 to methods known in the art. Those skilled in the field of drug discovery and 
development will understand that the precise source of compounds is not 
critical to the screening procedure(s) of the invention. Examples of natural 
compound sources include, but are not limited to, plant, fungal, prokaryotic, or 
animal sources, as well as modification of existing compounds. Nxunerous 

1 5 methods are also available for generating random or directed synthesis (e.g., 
semi-synthesis or total synthesis) of any number of chemical compounds, 
including, but not limited to, saccharide-, lipid-, peptide-, and nucleic acid- 
based compounds. Synthetic compound libraries may be obtained 
commercially or may be produced according to methods known in the art. 

20 Furthermore, if desired, any library or compound is readily modified using 
standard chemical, physical, or biochemical methods. 

In certain methods of the invention, interacting compoimds are 
identified as a result of a detectable label, or "tag," bound to either the 
compound or its associated solid support (for example, bead). A coded library 

25 of small molecule compounds may be prepared on beads as described, for 
example, in Combs et al., JACS (1996) vol. 1 18, p. 287-288. In addition, a 
number of encoding schemes are available, including peptide and nucleic acid 
codes (Kerr et aL, JACS (1993) vol. 1 15, p. 2529-2531; and Brenner & Lemer, 
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Proc. Nati. Acad. Sci. USA (1992) vol. 89, p. 5381-5383); chemical tags 
(Ohlmeyer et al., Proc. Natl. Acad. Sci. USA (1993) vol. 90, p. 109222-10926; 
and Maclean et al., Proc. Natl. Acad. Sci. USA (1997) vol. 94, p. 2805-2810); 
fluorophore tags (Yamashita & Weinstock (SmithKline Beecham Corporation), 
5 W095/32425 (1995); and Sebestyen et al., Pept. Proc. Eur. Pept Symp. 22nd 
1992 (1993), p. 63-64); and radio frequency tags (Nicolaou et al., Angew. 
Chem. Int Ed. Engl. (1995) vol. 34, p. 2289-2291; and Moran et al., JACS 
(1995) vol. 1 17, p. 10787-10788). Such labels may be read as described in the 
references above. 

10 Alternatively, an addressable library of compounds (for example, 

small molecule compoimds) may be prepared on a solid surface, such as a chip 
surface, A variety of techniques are available for immobilizing compounds on a 
chip surface, and any may be utilized. Preferable techniques include 
photolithography (Affymetrix, Santa Clara, CA), mechanical microspotting 

15 (Schena et al., Science (1995) vol. 270, p. 467-470; Synteni, Fremont, CA) and 
ink jetting (Incyte Pharmaceuticals, Palo Alto, CA; and Protogene, Palo Alto, 
CA). 



Tdentification of Cnmpound-Fusmn Tnteractions 

To identify interactions between compounds (for example, coded 

20 compounds) and protein-nucleic acid fusions, any method may be utilized 

which provides a means for detecting a label associated with the compound or 
fusion or, if appropriate, for isolating and determining the identity or "address" 
of the compound-fusion pair. 

In one particular example, compoimd-protein pairs (for example, 

25 small molecule-protein pairs) may be isolated and identified on beads. To 
detect a label associated with a bead, the bead resin is preferably plated out, 
followed by scanning, for example, -fer a fluorescent or radioactive label (using, 
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for example, a Phosphorimager to detect a radioactive label). Protein-nucleic 
acid fusion molecules binding to small molecules presented on a bead may be 
isolated by physically sorting the beads. Alternatively, beads boxmd to 
fluorescently labeled fusion molecules may be sorted on a fluorescence 
5 activated cell sorter (FACS). Selected beads may be individually and spatially 
separated (for example, into the wells of a 96-well microtiter plate). For RNA- 
protein fusions, molecules bound to individual beads may then be identified by 
reverse transcription of the RNA portion, followed by sequencing of the DNA 
as described by Roberts & Szostak (Proc. Natl. Acad. Sci. USA (1997) vol. 94, 

10 p. 12297-12302) and Szostak et al. (Selection of Proteins Using RNA-Protein 
Fusions, U.S.S.N. 09/007,005, January 14, 1998, and U.S.S.N. 09/247,190, 
February 9, 1999). The tag coding for the compound (for example, the small 
molecule compound) on each individual bead may be read as described above. 
Alternatively, ligand-receptor pairs on a chip surface may be 

1 5 detected by scanning the chip surface for radioactivity or fluorescence. The 
address of the interacting pair on the chip reveals the identity of the compound 
(for example, the small molecule compoimd). The fusion molecule may be 
picked from the chip surface using an addressable microcoUector or any other 
standard method (see, for example, KuimeUs et al.. Addressable Protein Arrays, 

20 U.S.S.N. 60/080,686, April 3, 1998, and U.S.S.N, 09/282,734, March 31, 

1999). The retrieved fusion molecule may then be identified by characterizing 
the nucleic acid portion of the fusion as described above. 

Compound ScreeniTig Utilizing a Read Fnrm^f 

As described above, compounds may be immobilized on a bead solid 
25 support and used to screen for protein-nucleic acid fusions, and specifically for 
RNA-protein fusions, which are capable of interacting with the compound. In 
one particular working example of this approach, the dihydrofolate reductase 
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(DHFR) gene was cloned out of a human liver cDNA library (Maxim Biotech, 
South San Francisco, CA). The construct contained the entire DHFR gene with 
an added C-termmal DYKDDDDK-ASA peptide tag (SEQ ID NO: 1). RNA- 
protein fusions of DHFR were prepared by PCR amplification of the DHFR 
5 coding sequence followed by fusion formation as described in Roberts & 
Szostak (Proc. Natl. Acad. Sci. USA (1997) vol. 94, p. 12297-12302) and 
Szostak et al. (Selection of Proteins Using RNA-Protein Fusions, U.S.S.N. 
09/007,005, January 14, 1998, and U.S.S.N. 09/247,190, Februaiy 9, 1999). 
The fusions were purified using oligo-dT-cellulose affinity chromatography 

10 (Edmonds et al., Proc. Natl. Acad. Sci. USA (1971) vol. 68:1336) and reverse 
transcribed with Superscript II reverse transcriptase according to the 
manufacturer's instructions. 100 fmol of DHFR fusion in 10 1 X buffer 
(Phosphate buffered saline, 1 M NaCl, 1 mg/ml BSA, 0.1 mg/ml sheared DNA, 
1% v/v Triton X-100) was combined with 10 pre-equilibrated methotrexate- 

15 agarose (as described in Kaufinan, Methods Enzymol. (1974) vol. 34:272-81) 
in a 500 jxL eppendorf tube. The slimy was incubated for 30 minutes at 
ambient temperature with mixing every 5 minutes. The slurry was then 
centrifiiged for 1 minute at 3000 rpm in an eppendorf microfuge. The liquid 
was removed, and the methotrexate-agarose was washed 3 times with 500 

20 of 1 X buffer. The fusions were then eluted by incubation of the methotrexate- 
agarose in 50 nL 30 methotrexate for 30 minutes at 37*'C. 

The results of this interaction assay are shown in Figure 4. In this 
figure, the percent of total fusion molecules was monitored by measuring •'^S- 
methionine label incorporated into the fusions during the translation step. As 

25 indicated, the third wash contained no significant amount of fusion molecules. 
In addition, of the total amount of fusion included within the matrix, 86% 
flowed through the bead column, and the other 14% was efiSciently eluted with 
methotrexate. 
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The present methods provide an efficient means for screening either 
small or large libraries for compoimd-protein binding interactions. In addition, 
these methods may be utilized to screen protein-nucleic acid fusions against 
5 one compoxmd or against a library of compoxmds. 

Commercial uses for screening a library of fusion molecules against 
a single compound include, without limitation, identification of a protein binder 
for a desired small molecule from a random pool of fusion molecules, 
rationalization of the mechanism of action of a given drug by isolating the 
10 cellular target from a pool of cellular mRNA-protein fusion molecules (or a 
pool of the DNA-protein fusion or hybrid fusion derivatives), and 
rationalization of the side effect profile of a given drug by isolating most or all 
target proteins from a pool of cellular mRNA-protein (or DNA-protein or 
hybrid-protein) fusion molecules, leading to an improved drug with reduced 
15 side effects. 

Uses for screening a library of fusion molecules against an encoded 
(addressable) library of compounds include, without limitation, screening a 
library of small molecule compounds with a library of nucleic acid-protein (for 
example, cellular mRNA-protein) fusion molecules for potential new lead 
20 compoimds (for example, ligands or enzyme inhibitors), screening a library of 
nucleic acid-protein (for example, cellular mRNA-protein) fusion molecules 
with a library of small molecule compounds for potential targets (for example, 
receptors or enzymes), and mapping of binding mteractions between the 
members of a protein library and the members of a small molecule compound 
25 library, thus providing a catalogue of ligand-protein pairs. 

All patents and publications mentioned herein are hereby 
incorporated by reference. 

Other embodiments are within the claims. 
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What is claimed is: 
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Claims 

1. A method for detecting a compound-protein interaction, said 
method comprising: 

(a) providing a compoimd library in which each member of said 
5 compound library is immobilized on a solid support; 

(b) contacting each member of said immobilized compound library 
in a single reaction chamber with each member of a protein-nucleic acid fusion 
library under conditions which allow the formation of compound-fusion 
complexes; 

1 0 (c) isolating said immobilized compound-fusion complexes; and 

(d) detecting said compound-fusion complex as an indication that the 
protein of said fusion interacts with said compound. 

2. The method of claim 1, wherein said protein-nucleic acid fusion is 
a protein-RNA fusion, a protein-DNA fusion, or a protein fused to a DNA- 

1 5 RNA hybrid molecule. 

3. The method of claim 1, wherein said solid support is a bead, 

4. The method of claim 3, wherein each said bead is coded with a 
unique detectable label. 

5. The method of claim 4, wherein the compound of said complexed 
20 protein-nucleic acid fusion is identified by said unique detectable label 

associated with said bead. 

6. The method of claim 4, wherein said detectable label is a peptide 
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label, a nucleic acid label, a chemical label, a fluorescent label, or a radio 
frequency tag. 

7. The method of claim 1, wherein said solid support is a chip and 
said compoxmd library is immobilized on said chip in an addresisable array. 

5 8. The method of claim 7, wherein each member of said protein- 

nucleic acid fusion library is detectably labeled. 

9. The method of claim 1, wherein said compoimd-fiision complex, 
or the components thereof, are recovered by release from said solid support. 

10. The method of claim 1, wherein said method further comprises 
10 recovering said protein-nucleic acid fusion from said solid support and 

identifying said protein. 

1 1 . The method of claim 10, wherein the identity of said protein is 
determined from the sequence of the nucleic acid portion of said protein- 
nucleic acid fusion. 

15 12. The method of claim 1 , wherein said compound is a small 

molecule. 

13. A method for detecting a compound-protein interaction, said 
method comprising: 

(a) providing a compound immobilized on a solid support; 
20 (b) contacting said immobilized compound with a protein-nucleic 

acid fusion libraiy under conditions which allow said fusion to bmd to said 
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compound; and 

(c) detecting a bound protein-nucleic acid fusion as an indication that 
the protein of said protein-nucleic acid fusion interacts with said compound 

14. The method of claim 13, wherein said protein-nucleic acid 

5 fusion is a protein-RNA fusion, a protein-DNA fusion, or a protein fused to a 
DNA-RNA hybrid molecule. 

15. The method of claim 13, wherein said protein-nucleic acid 
fusion is detectably labeled and said interaction is indicated by the association 
of said detectable label with said solid support. 

10 16. The method of claim 13, wherein said boimd protein-nucleic 

acid fusion is recovered by release from said solid support. 

17. The method of claim 13, wherein said method further comprises 
recovering said protein-nucleic acid fusion from said solid support and 
identifying said protein. 

15 18. The method of claim 17, wherein the identity of said protein is 

determined from the sequence of the nucleic acid portion of said protein- 
nucleic acid fixsion. 

19. The method of claim 13, wherein said solid support is a colimm, 
glass slide, chip, or bead. 



20 



20. The method of claim 13, wherein said compound is a small 

molecule. 
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SEQUENCE LISTING 

<110> Phylos, Inc. 

<120> IDENTIFICATION OF C0MP0T3ND- 

PROTEIN INTERACTIONS USING LIBRARIES 

OF PROTEIN-NUCLEIC ACID FUSION MOLECULES 



<130> 50036/017WO2 

<150> 60/096,820 
<151> 1998-08-17 

<160> 1 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Synthetic affinity tag 



<400> 
Asp Tyr Lys 
1 



1 

Asp Asp Asp Asp Lys Ala Ser Ala 
5 10 
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